Experiments on Speech/Music Discrimination
نویسنده
چکیده
The problem of speech/music discrimination has become increasingly important as automatic speech recognition system are applied to more real-world multimedia domains. One of the issue in the design of a signal classifier is the selection of an appropriate feature set that captures the temporal and spectral structures of the signal. Many features have been used in speech/music discrimination. The cepstral coefficients and the posterior-based features are two widely used features. Another issue is the selection of a classification algorithm, and a popular approach to speech/music discrimination is to use the decorrelated feature frames to train distribution models to distinguish training data labelled as speech or music [4, 10]. In this report, we test the performance of three kinds of classic features (MFCCs, Dynamic features, and the posterior-based features) and two kinds of classic distribution models (GMM and K-nearest neighbor) in separating speech and music signals. Furthermore, we implement a demo to classify a given audio signal into speech or music category.
منابع مشابه
Robust singing detection in speech/music discriminator design
In this paper, an approach for robust signing signal detection in speech/music discrimination is proposed and applied to applications of audio indexing. Conventional approaches in speech/music discrimination can provide reasonable performance with regular music signals but often perform poorly with singing segments. This is due mainly to the fact that speech and singing signals are extremely cl...
متن کاملRobust Features for Effective Speech and Music Discrimination
Speech and music discrimination is one of the most important issues for multimedia information retrieval and efficient coding. While many features have been proposed, seldom of which show robustness under noisy condition, especially in telecommunication applications. In this paper two novel features based on real cepstrum are presented to represent essential differences between music and speech...
متن کاملA Novel Speech/Music Discrimination Using Feature Dimensionality Reduction
In this paper, we propose an improved speech/music discrimination method based on a feature combination and dimensionality reduction approach. To improve discrimination ability, we use a feature based on spectral duration analysis and employ the hierarchical dimensionality reduction (HDR) method to reduce the effect of correlated features. Through various kinds of experiments on speech and musi...
متن کاملRobust speech music discrimination using spectrum's first order statistics and neural networks
Most of speech/music discrimination techniques proposed in the literature need a great amount of training data in order to provide acceptable results. Besides, they are usually context-dependent. In this paper, we propose a novel technique for speech/music discrimination which relies on first order sound spectrum’s statistics as feature vector and a neural network for classification. Experiment...
متن کاملA wavelet-based parameterization for speech/music discrimination
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005